Search results for "web crawling"

showing 3 items of 3 documents

Beyond the “ivory tower”. Comparing academic and non-academic knowledge on social entrepreneurship

2021

The increasing relevance of societal challenges has recently brought social entrepreneurship to the fore due to its capacity to leverage entrepreneurial processes to achieve social value while ensuring profits. In this study, we apply an experimental research method to analyse the concept of social entrepreneurship comprehensively. More specifically, we develop bibliometric analysis and web crawling techniques to gather information related to social entrepreneurship from Scopus and Wikipedia. We conduct a comparative network analysis of social entrepreneurship’s conceptual structure at academic and non-academic levels. This analysis has been performed considering scientific articles’ keywor…

EntrepreneurshipKnowledge managementScopusSocial entrepreneurshipSocial entrepreneurship Grand challenges Bibliometric analysis Web crawling Wikipedia Network analysisSocial entrepreneurshipArticleManagement Information SystemsSettore SECS-P/10 - Organizzazione AziendaleSettore SECS-P/07 - Economia AziendaleBibliometric analysisManagement of Technology and InnovationIvory towerSociologyGrand ChallengesGrand challengeComputingMilieux_THECOMPUTINGPROFESSIONbusiness.industryDigital transformationWeb crawlingNetwork analysiGrand challengesKnowledge baseBibliometric analysiNetwork analysisbusinessCentralityWikipediaInternational Entrepreneurship and Management Journal

researchProduct

On Utilizing Stochastic Non-linear Fractional Bin Packing to Resolve Distributed Web Crawling

2014

This paper deals with the extremely pertinent problem of web crawling, which is far from trivial considering the magnitude and all-pervasive nature of the World-Wide Web. While numerous AI tools can be used to deal with this task, in this paper we map the problem onto the combinatorially-hard stochastic non-linear fractional knapsack problem, which, in turn, is then solved using Learning Automata (LA). Such LA-based solutions have been recently shown to outperform previous state-of-the-art approaches to resource allocation in Web monitoring. However, the ever growing deployment of distributed systems raises the need for solutions that cope with a distributed setting. In this paper, we prese…

Theoretical computer scienceLearning automataBin packing problemComputer scienceWeb pageContinuous knapsack problemResource allocationDistributed web crawlingResource managementResource management (computing)Web crawler2014 IEEE 17th International Conference on Computational Science and Engineering

researchProduct

Web crawling dla celów lingwistycznych. Wybrane aspekty gromadzenia i analizy danych tekstowych na przykładzie rosyjskojęzycznych newsów internetowych

2021

Autor niniejszego artykułu zgromadził ok. 2,7 mln rosyjskojęzycznych newsów internetowych. Zasadnicze cele tego tekstu stanowią: omówienie pojęcia web crawlingu w odniesieniu do pozyskiwania internetowych danych tekstowych, omówienie kwestii strukturyzacji takich danych w nieanotowanych korpusach tekstowych, a także przedstawienie wybranych aspektów analizy danych strukturyzowanych w ten sposób. Autor rozpatruje newsy internetowe jako połączenie tekstu zasadniczego oraz identyfikujących i charakteryzujących go metadanych (wyróżnionych podczas automatycznej ich ekscerpcji ze stron internetowych). Rozdział newsów na tekst zasadniczy i metadane stwarza możliwość przeprowadzenia ich analizy z d…

ogranicznik tekstucorpus of text fileszwiązki wielowyrazowetext delimiterreproduktInternet newsmulti-word expressionsweb crawlingre-productkorpus plików tekstowychquotenews internetowycudzysłówPrace Językoznawcze

researchProduct